A Novel Method For Speech Segmentation Based On Speakers' Characteristics

نویسندگان

Behrouz Abdolali

Hossein Sameti

چکیده

Speech Segmentation is the process change point detection for partitioning an input audio stream into regions each of which corresponds to only one audio source or one speaker. One application of this system is in Speaker Diarization systems. There are several methods for speaker segmentation; however, most of the Speaker Diarization Systems use BIC-based Segmentation methods. The main goal of this paper is to propose a new method for speaker segmentation with higher speed than the current methods e.g. BIC and acceptable accuracy. Our proposed method is based on the pitch frequency of the speech. The accuracy of this method is similar to the accuracy of common speaker segmentation methods. However, its computation cost is much less than theirs. We show that our method is about 2.4 times faster than the BICbased method, while the average accuracy of pitch-based method is slightly higher than that of the BICbased method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Spot-Enhancement Anisotropic Diffusion Method for the Improvement of Segmentation in Two-dimensional Gel Electrophoresis Images, Based on the Watershed Transform Algorithm

Introduction Two-dimensional gel electrophoresis (2DGE) is a powerful technique in proteomics for protein separation. In this technique, spot segmentation is an essential stage, which can be challenging due to problems such as overlapping spots, streaks, artifacts and noise. Watershed transform is one of the common methods for image segmentation. Nevertheless, in 2DGE image segmentation, the no...

متن کامل

Word segmentation in Persian continuous speech using F0 contour

Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...

متن کامل

Improving Brain Magnetic Resonance Image (MRI) Segmentation via a Novel Algorithm based on Genetic and Regional Growth

Background:Â Regarding the importance of right diagnosis in medical applications, various methods have been exploited for processing medical images solar. The method of segmentation is used to analyze anal to miscall structures in medical imaging.Objective:Â This study describes a new method for brain Magnetic Resonance Image (MRI) segmentation via a novel algorithm based on genetic and regiona...

متن کامل

UNIVERSITY OF WEST BOHEMIA IN PILSEN, DEPARTMENT OF CYBERNETICS A Method for Speaker-Based Segmentation of Audio Signals

The paper deals with the problem of speaker-based segmentation. The goal of this task is to extract homogeneous segments containing the longest possible utterances produced by a single speaker. In the method presented here, no assumption is made about prior knowledge of the speaker or speech signal characteristics (there is no speaker model, no speech model, even the number of speakers in the r...

متن کامل

Mixed-lingual spoken word recognition by using VQ codebook sequences of variable length segments

We are investigating unsupervised phone modeling. This paper describes a derivation method of VQ codebook sequences of variable length segments from spoken word samples, and also describes evaluation results by applying the method to mixed-lingual speech recognition tasks which include non-native speakers. The VQ codebook is generated based on a piecewise linear segmentation method which includ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1205.1794 شماره

صفحات -

تاریخ انتشار 2012

A Novel Method For Speech Segmentation Based On Speakers' Characteristics

نویسندگان

چکیده

منابع مشابه

A Novel Spot-Enhancement Anisotropic Diffusion Method for the Improvement of Segmentation in Two-dimensional Gel Electrophoresis Images, Based on the Watershed Transform Algorithm

Word segmentation in Persian continuous speech using F0 contour

Improving Brain Magnetic Resonance Image (MRI) Segmentation via a Novel Algorithm based on Genetic and Regional Growth

UNIVERSITY OF WEST BOHEMIA IN PILSEN, DEPARTMENT OF CYBERNETICS A Method for Speaker-Based Segmentation of Audio Signals

Mixed-lingual spoken word recognition by using VQ codebook sequences of variable length segments

عنوان ژورنال:

اشتراک گذاری